首页> 外文OA文献 >Scalability and parallel execution of OmpSs-OpenCL tasks on heterogeneous CPU-GPU environment
【2h】

Scalability and parallel execution of OmpSs-OpenCL tasks on heterogeneous CPU-GPU environment

机译:异构CpU-GpU环境中Ompss-OpenCL任务的可伸缩性和并行执行

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

With heterogeneous computing becoming mainstream, researchers and software vendors have been trying to exploit the best of the underlying architectures like GPUs or CPUs to enhance performance. Parallel programming models play a crucial role in achieving this enhancement. One such model is OpenCL, a parallel computing API for cross platform computations targeting heterogeneous architectures. However, OpenCL is a low-level programming language, therefore it can be time consuming to directly develop OpenCL code. To address this shortcoming, OpenCL has been integrated with OmpSs, a task-based programming model to provide abstraction to the user thereby reducing programmer effort. OmpSs-OpenCL programming model deals with a single OpenCL device either a CPU or a GPU. In this paper, we upgrade OmpSs-OpenCL programming model by supporting parallel execution of tasks across multiple CPU-GPU heterogeneous platforms. We discuss the design of the programming model along with its asynchronous runtime system. We investigated scalability of four OmpSs-OpenCL benchmarks across 4 GPUs gaining speedup of up to 4x. Further, in order to achieve effective utilization of the computing resources, we present static and work-stealing scheduling techniques. We show results of parallel execution of applications using OmpSs-OpenCL model and use heterogeneous workloads to evaluate our scheduling techniques on a heterogeneous CPU-GPU platform.
机译:随着异构计算成为主流,研究人员和软件供应商一直在尝试利用GPU或CPU之类的最佳基础架构来提高性能。并行编程模型在实现此增强功能中起着至关重要的作用。这样的模型就是OpenCL,OpenCL是一种针对异构架构的跨平台计算的并行计算API。但是,OpenCL是一种低级编程语言,因此直接开发OpenCL代码可能很耗时。为了解决此缺点,OpenCL已与OmpSs集成,OmpSs是基于任务的编程模型,可为用户提供抽象,从而减少了程序员的工作量。 OmpSs-OpenCL编程模型处理单个OpenCL设备(CPU或GPU)。在本文中,我们通过支持跨多个CPU-GPU异构平台并行执行任务来升级OmpSs-OpenCL编程模型。我们将讨论编程模型及其异步运行时系统的设计。我们研究了四个OmpSs-OpenCL基准测试在4个GPU上的可扩展性,将其速度提高了4倍。此外,为了实现对计算资源的有效利用,我们提出了静态和窃取工作的调度技术。我们显示使用OmpSs-OpenCL模型并行执行应用程序的结果,并使用异构工作负载来评估我们在异构CPU-GPU平台上的调度技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号